Looking over the rim: algorithms for cheminformatics from computer scientists
نویسندگان
چکیده
In recent years a number of methods were invented in the data mining/machine learning field that have received little attention in the cheminformatics world even though they offer interesting properties for these types of applications as well even compared to some similar algorithms published primarily in the cheminformatics space. In this talk we want to highlight three of these algorithms/approaches. The first is MoSS [1], a frequent subgraph miner that can not only be used to find common substructures in a set of molecules but is also able to compute the MCSS very fast and has some extension especially suited for molecules. The second presented approach deals with the problem of finding diverse subsets of molecules [2]. Quite interestingly, not only finding a diverse subset can be a challenging task but already the definition of diversity is not as straight-forward as it seems at the first glance. The third algorithm goes along the same lines but tries to find similar molecules by looking at their properties from so-called parallel universes[3]. Each universe contains a set of related properties and partial predictive models are built in each universe separately. Through interactive model construction, e.g. by so-called Neighbourgrams, the models from one universe can aid the construction of a models in other universes.
منابع مشابه
Feasibility of detecting and localizing radioactive source using image processing and computational geometry algorithms
We consider the problem of finding the localization of radioactive source by using data from a digital camera. In other words, the camera could help us to detect the direction of radioactive rays radiation. Therefore, the outcome could be used to command a robot to move toward the true direction to achieve the source. The process of camera data is performed by using image processing and computa...
متن کاملAn improved opposition-based Crow Search Algorithm for Data Clustering
Data clustering is an ideal way of working with a huge amount of data and looking for a structure in the dataset. In other words, clustering is the classification of the same data; the similarity among the data in a cluster is maximum and the similarity among the data in the different clusters is minimal. The innovation of this paper is a clustering method based on the Crow Search Algorithm (CS...
متن کاملOf possible cheminformatics futures
For over a decade, cheminformatics has contributed to a wide array of scientific tasks from analytical chemistry and biochemistry to pharmacology and drug discovery; and although its contributions to decision making are recognized, the challenge is how it would contribute to faster development of novel, better products. Here we address the future of cheminformatics with primary focus on innovat...
متن کاملMulticast computer network routing using genetic algorithm and ant colony
Due to the growth and development of computer networks, the importance of the routing topic has been increased. The importance of the use of multicast networks is not negligible nowadays. Many of multimedia programs need to use a communication link to send a packet from a sender to several receivers. To support such programs, there is a need to make an optimal multicast tree to indicate the opt...
متن کاملThe Chemical Information Ontology: Provenance and Disambiguation for Chemical Data on the Biological Semantic Web
Cheminformatics is the application of informatics techniques to solve chemical problems in silico. There are many areas in biology where cheminformatics plays an important role in computational research, including metabolism, proteomics, and systems biology. One critical aspect in the application of cheminformatics in these fields is the accurate exchange of data, which is increasingly accompli...
متن کامل